ON TEMPLE-KATO LIKE INEQUALITIES AND APPLICATIONS 

LUKA GRUBISIC 

Abstract. We give both lower and upper estimates for eigenvalues of unbounded positive def- 
inite operators in an arbitrary Hilbert space. We show scaling robust relative eigenvalue estimates 
for these operators in analogy to such estimates of current interest in Numerical Linear Algebra. 
Only simple matrix theoretic tools like Schur complements have been used. As prototypes for the 
strength of our method we discuss a singularly perturbed Schroedinger operator and study con- 
vergence estimates for finite element approximations. The estimates can be viewed as a natural 
quadratic form version of the celebrated Temple-Kato inequality. 



1. Introduction 

The purpose of this article is to establish scaling robust estimates for discrete eigenvalues of 
positive definite operators in a Hilbert space. We also prove that our estimates are optimal for 
a residual type analysis of lowermost eigenvalues of those operators. Our approach uses the 
theory of quadratic forms from [21, Chapters VI- VIII] and an adaptation of the matrix relative 
perturbation theory. As a result we establish the same high performance residual type estimates 
from imi in our more general setting. For a review of the matrix relative perturbation theory see 
ll24l and the references therein. 

It turns out that positive-definiteness of the matrix is the key structural property which is 
needed for the analysis of [III [121. Subsequently, we prove our estimates for an abstract pos- 
itive definite form in an arbitrary Hilbert space. This, together with the fact that the estimates 
also hold for the discrete eigenvalues which are in gaps of the essential spectrum, indicates that 
our simple matrix analytic techniques are well adapted to the class of problems under study, e.g. 
our technique yields high performance estimates without forcing us to impose any unnecessary 
restrictions. This abstract approach is further justified by the fact that we simultaneously con- 
sider applications of these estimates to a study of the convergence properties of adaptive finite 
element methods as well as to a quantitative study of the asymptotic properties of eigenvalue 
problems in the large coupling limit. Typical operators in the large coupling limit setting are 
those from SISIIII. 

The obtained estimates are the same as those which have proven themselves in [fT2| as a 
significant tool in the development of modern mathematical software. The main feature of the 
matrix eigenvalue algorithms from [fT2ll is that they are robust when applied to extremely badly 
scaled input matrices. We bring this in correspondence with the behavior of the spectrum of 
stiffly/singularly perturbed operators from L3,i8,29J. We use a model Schroedinger operator to 
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show that our estimates are optimal for the class of stiff/singular perturbations, see Section HI 
An extensive study of non-inhibited stiff families of operators has been performed, with the help 
of the results from this article, in |[T6l and will be published in the subsequent report. 

When studying the convergence of finite element procedures we adopt the approach of ini,l25l. 
Our results give a new flavor to the analysis of the stoping criteria for preconditioned inverse 
iterations from [25, Section 4]. In comparison, we are more explicit about the dependence of 
the ingredients of the error on the input data and we can prove the equivalence of the error and 
the estimator, see Section [51 Furthermore, in the Conclusion we briefly outline a simple way to 
obtain optimal eigenvector estimates from our eigenvalue results. 

In Section [TTj we relate our new results — which appear in Section [3] — to other results in the 
literature. In Section 11.2! we use a simple matrix model problem to give a first flavor of the 
results by comparing our approach with that of the Temple-Kato inequality. 

1.1. A comparison with other approaches. A number of recent studies of the eigenvalue 
approximation problem — when classified from the viewpoint of perturbation theory (see e.g. 
[[T9ll24l ) — could be seen to fall into the following classes: 

(1 ) Results obtained in the absolute(sometimes also referred to as regular) setting. For recent 
results see [|26l and references therein. The performance of such estimates, when applied 
to an unbounded operator, depends on the method of the regularization. The delicacy of 
this issue is illustrated on an example below. 

(2) Results which are obtained by interpreting the eigenvalue problem as a nonlinear prob- 
lem (in both eigenvalue and eigenvector). This approach makes a treatment of the eigen- 
value multiplicity somewhat more difficult, since it is not easy to profit from the special 
structure which the eigenvalue problem has, cf. [18, Remark 7]. 

(3) Direct analysis of a representation of the single vector residual r = Htjj — f^-ip, coupled 
with a consideration of the approximation properties of the function space which is used 
to generate the test vector ijj, WipW = 1, where we have used /x := {ijj, Hi/j) to denote the 
Ritz value. Such estimates are essentially asymptotic in nature (cf. [[IT, Remark 3.2]) 
or are specifically tailored for the particular (class of) problem(s) under study (cf. [|23l 
Theorem 4.1]). 

We propose a technique which is based on the relative perturbation theory for quadratic forms 
in a Hilbert space from [|2T1 Chapters VI- VIII]. In this regularization framework (which was 
developed in [[151 [TTl ) we solely use elementary matrix techniques, like LU-decomposition and 
Schur complements, to obtain new block operator residual equation. This residual equation 
has the same form as the corresponding matrix result from [11], but holds in this more general 
setting. It appears to be better suited to dealing with eigenvalue multiplicity than are other 
approaches. We also argue that when dealing with the lower part of the spectrum of the positive 
definite operator our choice of the regularization is optimal. 

Specifically, we follow the approach of [7l[Tni|Tl] and reverse the trend to show that these 
finite dimensional results have a lot to offer in the original setting of [21], and in particular as 
tools for a numerical study of singularly perturbed (integro) differential operators. This paper 
will heavily use the general construction from |[T5l [TTl . In short we propose, assuming we are 
given a positive definite and self-adjoint operator H and the orthogonal projection P: 
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• Construct the positive definite operator H' from [fTSlfTTl . H' is the block diagonal part 
of H with respect to P and it is given formally as H' = PHP + (I - P)H(I - P). 
In this paper we shall use the notation Hp = H' to emphasize the dependence on the 
projection P. 

• Scale the perturbation H — Hp with Hp^^^ to obtain the bounded operator 6H^ : = 
"Hp^^^(H — Hp)Hp^^^". This is equivalent to working with the perturbation H — Hp 
in the dual energy space which is associated to Hp. 

• Apply an adapted result from BTllTOlIIIl to obtain the desired eigenvalue/ vector estimate 
in the quadratic form setting. 

At the end, we would like to emphasize that the original matrix inequalities from [TO [121 have 
been extensively tested in the process of developing new finite precision eigenvalue software. In 
the course of this testing a large body of test examples has been generated by judicious random 
searches as well as by a modification of the known examples from science and engineering. The 
inequalities have been found to be numerically sharp, as is reported in [T2], on numerous test 
matrices. 

1.2. Relationship to Temple-Kato inequality. The central theme of this paper is the issue 
of how to regularize an unbounded eigenvalue problem to obtain an object which can then be 
algebraically studied. As an introduction to the issue we shortly review other approaches with 
special emphasis on the regularity issues. This section is meant to provide the motivation for 
this study and it extends the introduction. 

The history of a posteriori eigenvalue approximation estimates goes back to [|20l |32l . Such 
inequalities (most recently studied in [,26, |) have a general form of 

(1.1) |eRROR| < CONDITIONING X ||RESIDUAL || ^. 
Our estimates will have the form of 

(1.2) III RESIDUAL |||r^g,< |rel(ERROR)| < rel(CONDITIONING) X ||| RESIDUAL |||j^gi, 

where the measures ||| . |||rei and | rel (error) | and rel (conditioning) denote the appropriate in- 
gredients from matrix (relative) perturbation theory as given in [|24l . 

Let us now be more precise. We shall always work in the background Hilbert space H, which 
is equipped with the scalar product (■, ■) and the norm || ■ || = (-, ■)^/^. Let H be a self-adjoint 
operator which is bounded from below and let ^ be some vector of the norm one in its domain 
of definition Vili). Define the Rayleigh-quotient fx := (^,H^) and assumeQthat A2(H) > /i 
then classical Temple-Kato inequality from [28, Theorem VIII. 5, Volume IV pp. 84] reads 

(1.3, ^_ W.H,,)-W.H^.)% ,,(H)<^, 

A2(H) - ij, 

The vector r = Hip — fiijj is called the residual (associated to ijj) and it holds (Hip, Hip) — 
(■ip, Hipy = II r p. Now it is easy to see that (11.31) implies 

i|H7/;-/i^|p 



(1.4) fi-Xi< 



A2 - 



^We are counting the eigenvalues, which are below the infimum of the essential spectrum, in the ascending order 
according to multiplicity. 
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which has a general form of Here we have used Aj = Ai(H), i = 1, 2 to simplify the 

notation. The norm ||r|| can be seen as an "approximation defect" of the vector -ip, since -ip is an 
eigenvector if and only if ||r|| = 0. 

As already stated our estimates have a similar general form (see (11.21 )) to the Temple-Kato 
inequality but are obtained under the assumptions of the perturbation theory for symmetric forms 
from [|2T1 Chapters VI- VIII]. A consequence of this is that, in a case of a positive definite 
operator H, we are able to directly work with test vectors from the domain of the symmetric 
form which is according to [2T, Theorem VI-2.23, pp. 331] equal to ©(H^/^). Our version of 
(fT31) also assumes /i < A2(H) but allows e V(H^/^), \\tp\\ = 1 and establishes (Theorem [Ml) 
the estimate^ 

^■^ ii^ii^-i - /i -A2-/i mi,-. ' 

where || ■ ||h-i := ||H^^/^ ■ || is the classical H^^-norm. To recognize the importance of the 
original Temple-Kato approach, as well as in line with the terminology from Numerical Linear 
Algebra, see [|25l pp. 271], we call all the inequalities which have the form of (11.11 ) or (11.21 ) 
Temple-Kato like inequalities. By Temple-Kato approach we mean the notion that high per- 
formance eigenvalue estimates should be obtained as a mixture of the a posteriori computable 
measure of the approximation defect ||r||^ and the a priori assumed quantitative information 
1/(A2(H) -/i) = max{|A-/ir^ : A G S(H) \ {Ai(H)}} on the conditioning of Ai(H). 

Let us now discuss (11.51) . The measure of rel(cONDlTlONlNG) is in this context the so called 
relative gap (A2 — /^)/ (A2 + /i), which distinguishes close eigenvalues better than does the abso- 
lute gap (A2 — /i) from (11.31 ). Furthermore, both the residual measure ||H^^ — V^||h-i/II^IIh-i 
as well as the relative gap are robust with regard to scaling (e.g. "dimensionless quantities"). 
Thus, the most important message of (11.51) is the same as in [fT2l Example 2.1]: The approxi- 
mation 11 has completely resolved the eigenvalue Ai when the (relative) residual measure drops 
below the relative gap. We also note that (11.51) . unlike (11.31 ) cannot give negative lower bounds 
to eigenvalues of positive definite operators. On the other hand, the H^-norm is more difficult 
to evaluate than are the ingredients of (11.51) . For a possibility to do this see Remark |3^ Section 
IHand Em [161. Approximations to norm of the residual can also be computed in a more 
accessible scalar product, see lUOl Remark 7]. Note that the restriction ip E ^^(H) from (11.31) 
excludes — without prior regularization of the problem — the case when ^ is a continuous piece- 
wise linear function and H = — A is the negative Laplace operator. On the other hand, in this 
case is our theory directly applicable and working estimates are explicitly given, accompanied 
with an argument for their optimality. Furthermore, as an illustration of our matrix theoretic 
approach to unbounded operators, we will show (in Section |2l) a "matrix analytic" way to obtain 
a variant of the original Temple-Kato inequality. 

We close this section by a simple and small numerical example which should illustrate the 
dichotomy between the (easy) computability and scaling robustness of eigenvalue estimates. 
We will be comparing the first order estimates (in the approximation defect) from [15ltl7| with 
the second order estimate (11.41 ). As a model we consider the asymptotic behavior of the family 

^The conditioning constant j^^^ is a deliberate overestimate of the optimal constant gi from Theorem l3.4l It is 
a classical ingredient of the relative perturbation theory, see lfT2ll24l . 
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of positive-definite matrices 
(1.6) H« 



101 101 

0^ Wo 



K OO. 



101 

The results of [fTSl [TTl . which in this specialization to the matrix can be obtained using 
[[TTl Theorem 1.1] and a direct computation, use the relative residual measure 

vW-= ITni ' = (V',H?/'). 

Ih-1 



Before we proceed note the following geometrical facts. It holds that ri{-ip) is equal to the 
sine of the angle Z(H^, ip) in the scalar product (■, H^^-). We denote this by writing 77 (^) = 
sin Z(H.ip, ^)h-i. It can also be shown that in the scalar product of the background space H 
the identity 'r]{ip) = sin Z(H^/^^, H"^/^?/;) holds. Furthermore, r](tp) is also an approximation 
defect measure, since r](ip) > and 'ip is an eigenvector of H if and only if r]('ip) = 0. 

Now, [[m Theorem 5.1] states that if for ip e T'(H^/2) the assumption r]{ip) < holds 
then 

(1.7) M!)^<,(^). 

This estimate is equivalent with 

(1.8) (1 - vW)f^ < Ai(H) < (1 + r7(^))/i. 

and since both ^ and r](ip) are computable relations (11.71 ) and (11.81) give both a lower as well as 
an upper estimate for Ai(H). 

Now, take ip = \l O] * as the test vector and compare Wr^W, = 'H.i^ip — fiip and r]K{ip), 
V^iiP) :=||H iV^-V^llH-i/ll^lln-. 



One computes Hr^H = whereas rjJip) = - , . This shows that the second 

order estimate from (11.41) . as opposed to the first order estimate (11.71) . does not detect that (/x — 
Ai(Hk))/ fJi = ^ ^1^2 + — ^-OasK^oo. For more details on a numerical comparison of 
(fO-(fr?l) and if^-fiTMsee (15', Table 1.1]. 

Caution has to be exercised when comparing absolute and relative estimates on this exam- 
ple. It is known that absolute and relative estimation theory can (sometimes) yield equiva- 
lent estimates, cf. [19]. For instance, in the case of a single lowermost eigenvalue Ai(H) 
an inequality which has a similar form as the righthand side inequality from (11.51) can be ob- 
tained if one applies (11.41) to the operator H^^ in the Hilbert space with the H-scalar product 
(., ■)h := (H^/^., H^/^.). However, such approaches which first derive the eigenvalue estimates 
in the background (absolute) scalar product and then scale the operator at hand to fit this frame- 
work do not provide a proof of the optimality of the estimates for parameter dependent problems. 
Furthermore, our approach is more natural for the treatment of the eigenvalue multiplicity which 
can be seen on the new block-operator residual equation which yields error estimates that utilize 
any unitary invariant norm of the block operator residual. Such a posteriori estimates and block 
operator residual equations did not appear before in the context of the eigenvalue estimation 
for unbounded operators. We also note that the numerical examples, reported in [11211 . indicate 
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that our choice of the (relative) residual measure r]{ilj), as well as the choice of the measure 
of the conditioning (relative gap) yield numerically sharp eigenvalue estimates. Bridging the 
relative perturbation theory from [jVl [TOl [HI with the theory of eigenvalue estimation for un- 
bounded operators is the declared aim of this work. In addition to that we will outline a new 
general framework for analyzing asymptotic exactness of eigenvalue estimators for parameter 
dependent eigenvalue problems. On our simple 3x3 example this general result reads 

M-Ai(H^) 

lim = 1 

and the rate of the convergence appears to be rather rapid. We will also show, by comparing 
the block operator residual equation which yields (11.41) with the residual equation that yields 
(|1.5I) . that the approach of the relative perturbation theory — e.g. first scale and then estimate 
rather than as in the absolute approach where one first estimates and then scales — is the right 
one when estimating the lower part of the spectrum of a positive definite unbounded operator. 

2. A PERTURBATION APPROACH TO RAYLEIGH-RITZ ESTIMATES 

We follow the general notational conventions and the terminology of [21 , Chapters VI- VIII]. 
Minor differences are contained in the following list of notation and terminology. 

• 7i ... is an infinite dimensional Hilbert space, can be both real or complex 

• ( • , • ) ; 1 1 ■ 1 1 . . . the scalar product on H, linear in the second argument and anti-linear (when 
Ti. is complex) in the first; the norm on H 

• Til ® li.2--- the direct sum of the Hilbert spaces Tii and I-L2, for any a; G TYi © 7-^2 we 

have X = xi® X2 = for Xj G Hi, 2=1,2 

• S(H), Sess(H); Acss(H) ... the spectrum and the essential spectrum of H; the infimum 
of the essential spectrum of H 

• A < B ... order relation between self-adjoint operators (matrices), is equivalent with the 
statement that B — Ah positive 

• C{l-i)\ CiHi, 7-^2)"- the space of bounded linear operators on H, which is equipped with 
the norm || ■ || ; the space of bounded linear operators from Hi to H2 

• R(X), N(X) ... the range and the null space of the linear operator X 

• P, P±... the orthogonal projections P and Pi := I — P 

• Jq ... a permutation of N 

• diag(M, W) ... the block diagonal operator matrix with the operators M, W on its diag- 
onal. The operators M, W can be both bounded and unbounded. The same notation is 
used to define the diagonal m x m matrix 

diag(ai, ■ ■ ■ , am), with cti, ■ ■ ■ , am on its diagonal. 

• Si{A) > S2{A) > ■ ■ ■ , Smax(A), SminiA) ... the singular values of the compact operator A 
ordered in the descending order according to multiplicity, the minimal (if it exists) and 
the maximal singular value of A 

• III X III ... a unitary invariant or operator cross norm of the operator X. Since ||| ■ ||| 
depends only on the singular values of the operator, we do not notationaly distinguish 
between the instances of the norm ||| • ||| on C{H), C{R{P)), C{R{P), R{P)-^), or such. 



ON TEMPLE-KATO LIKE INEQUALITIES AND APPLICATIONS 



7 



Precise properties of a unitary invariant norm will be listed in Section [3l for further 
details see lf30l . 

• tr{X), III X \Ihs ■■■ the trace a the Hilbert-Schmidt norm of the operator X, it holds 



As a general policy to simplify the notation we shall always drop indices when there in no danger 
of confusion. 

We will generically assume that we have a closed, symmetric and semibounded from below 
form h with the dense domain Q{h) C H as given in [HI (VL1.5)-(VI.1.11), pp. 308-310]. 
The form h which has a strictly positive lower bound will be called positive-definite. This 
is also a small departure from the terminology of {TP, Section VI.2, pp. 310]. Such form h 
defines the self-adjoint and positive definite operator H in the sense of [21, Theorem VI.2.23, 
pp. 331]. Furthermore, the operator H is densely defined with the domain ViH) C Q{h) 
and ©(H^/^) = Q{h). We also generically assume that H has discrete eigenvalues Ai(H) < 
• ■ • < Am(H) < ■ • • < Acss(H), where we count the eigenvalues according to multiplicity. 
Another departure from the terminology of Kato is that we use h('ip, (p) to denote the value of h 
on ^, G Q{h), but we write h[ip] := h{ijj, tp) for the associated quadratic form h[-]. We also 
emphasize that we use ■* to denote the adjoint both in the real as well as in the complex Hilbert 
space H as is customary in [21, Chapters VI- VIII]. 

Let us now fix our Rayleigh-Ritz terminology and outline the main construction from |[T5l[T7l . 
We assume that we are interested in approximating the eigenvalue A(H) of finite multiplicity 
m G N. Instead of only one test vector, as was the situation in (11.71) , we now need a test 
sub space of dimension m. 

Let therefore P be an orthogonal projection such that dim R(P) = m and R(P) C Q{h). We 
call R(-P) the test subspace for (the approximation of) A(H). The operator S G £(R(P)), H = 
(H^/^P|r(p))*H^/^P|r(p) will be called the (generalized) Rayleigh quotient. Its eigenvalues 
A^i < ■ ■ ■ < l^m will be called the Ritz values from the test subspace R(P) and the vectors 
Ui G R(P), Huj = HiUi, \\ui\\ = 1 will be called the Ritz vectors. We also define the operator 
W : R(P)^ R(^)^ as the one which is defined in R(P)^ by the form h{P±-, P±-) in the 
sense of fIV. Theorem VL2.23, pp. 331]. 

2.1. A variant of the Temple-Kato inequality. Before we outline the main form theoretic 
construction from [fTSlfTTl let us illustrate our "matrix theoretic" approach to spectral theory by 
proving a variant of (11.11 ) in a case when Ai(H) has a finite multiplicity m. 

Let H be a self-adjoint operator which is bounded from below. Let further, counting the 
eigenvalues according to multiplicity, A := Ai(H) = Am(H) < Am+i(H), A 7^ and we 
assume that we have a test subspace R(P) C T)(li), dim R(P) = m. The environment space 
Ti. can be decomposed as H = R(P) © R(^)"'" and H can be represented as a block-operator 



Whs 



v/tr(X*X), see ^ 




(2.1) 



H 



S K* 
K W 



^For more on block operator matrices see for instance [|6l. 
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where K G C{R{P), R(P)^), K = P^HP^^^. Using the standard resulj] [161 Theorem 5.1] 
one obtains that there exist m-eigenvalues Xj^ (H) < ■ ■ ■ < Xj^ (H) such that 

IXjX'H.) - jJ'il < \\K\\, i=l,...m. 

If we further assume that \\K\\ < Am+i(H) — fim, then 

|Aj(H) — < ||i^r||, i = l,...m. 

follows from the perturbation construction of f6^, Theorem 5.1 and Remark 5.2]. Furthermore, 
the spectral calculus for the self-adjoint operator W and the min-max formulae yield 

^m+l[tl) — fJrn— \\J^ \\ 

The assumption R(P) C T>{ii) allows us to justify the following matrix representation 



H-AI 



I 









I 





H - AI - K*B2K 



I 
I 



Babs:=(W-AI). 



Babs 

We can now use a generalization of the so called Wilkinson's trick from [|27l pp. 183], which 
will be stated explicitly as Theorem 13 . 1 1 below, to conclude that 

m = dim N(H - AI) = dim N(diag(S - AI - K*B-lK, B^bs)) 

and this can only happen if 

(2.2) E- Xl = K*(W ~ Xiy^K . 
Furthermore, we establish 

(2.3) III diag(/.. - A) |||< -— i III K ||| ||i^||, 

Am+l(-tlj — flm — ||A II 

where ||| ■ ||| is any unitary invariant norm. A similar inequality holds for discrete eigenvalues 
which are located in the interior of S(H). In particular, if we denote the (single) Ritz vector 
residuals by := Hui — fXiUi and apply the trace operator tr(-) on (12.21) . we obtair@ 



^ Am+i(H) - /i„ - ||K|| ^ 

This generalizes the estimate from (11.31) to the case in which Ai(H) has the multiplicity m. The 
quotient ^ +i(h)-^ -\\k\\ numerically inferior to ^ +i(H)-At ' (12.31) holds for any unitary 
invariant norm. This is a mechanism which allows us to individually treat Ritz vectors of differ- 
ent approximation properties. Furthermore, in view of the discussion from |31 pp. 8] estimates 
(12.31) give significant new information when compared just with (12.41) . It could be argued that 
-r jTjT^ uTTJi is a reasonable ingredient of the estimates since, in typical situations, one uses 



^Also known as Kahan's residual theorem in the case of the test subspace R{P) of Krylov-Weinstein inequahty 
in the case of one test vector V', see H l27ll . 

^ The inequality i2.4\ appeared with a better gap estimate in the bounded operator setting in ||26l . whereas 
the inequahty ( 12.31 ) appeared in the matrix setting in |31|. The significance of this inequalities in this paper is to 
introduce the Schur complement technique which will be the main tool later. 
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ll6l Theorem 5.1 and Remark 5.2] to obtain a bound for -r 4jt . Also, see the discussion on 

JH pp. 305]. For a way to compute ||| K ||| see (H Section 9]. 

2.2. The symmetric form approach. The previous computation can not be justified in the case 
in which R(P) C Q{h) = ©(H^/^) but R(P) ^ V{U). Precisely this is the case in which we 
are interested. 

Let us now outline the main perturbation construction from [[TSlfTTl . We start by defining the 
positive definite form 

hp{u, v) = h{Pu, Pv) + h{P±u, P±v), u,v e Q{h) 

and the self-adjoint operator Hp which is defined by hp in the sense of [[271 Theorem VI- 
2.23, pp. 331]. The operator Hp and the form /ip are called the P-diagonal part of H and h, 
respectively. We also define the form 

Sh^{u, v) = h{Pu, P±v) + h{P^u, Pv), u,v E Q{h), 

which is an approximation defect in R(P), since R(-P) is an invariant subspace of H if and only 
if 5h^ = 0, for a proof see |[T7I . Furthermore, it was shown in ||T5l[T7ll that 

(1) R(P) reduces Hp 

(2) S = PHpP|,(^) 

(3) R(H^^ — Hp^) is finite dimensional which implies that 
Eess(H) = Sess(Hp) according to the Weyl theorem. 

The properties 1), 2) and 3) imply that yUj G S(Hp) and Acss(H) = Acss(Hp) together with 
the assumption fii < Aess(H) yields that yUj are the eigenvalues of the operator Hp with finite 
multiplicity. We are setting the scene for an application of the relative perturbation theory from 
|[2n Chapters VI- VIII] and so we will be able, regardless of the fact thaj] V{H) ^ V(Hp), to 
interpret H as a perturbation of Hp and thus bring /ij in connection with some component of 
S(H). 

This was the main line of argument in |[T5l[T7l . Although some of the technical results about 
Hp, which we shall now state are not explicitly given in [[T5l[T7 1 we present them here without 
proof. However, all of their proofs are obtainable as minor modifications of the arguments from 
|[T5l[T7ll and do not bring any new information. 

Let us now look into the structure of this construction in more detail. According to [[T5i 
Theorem 4.5] the form (■, ■) := 5/?.^(Hp^^^-, Hp^''^-) defines the bounded operator 6H^ 
and 

.... u.rrPu (V;,H-V)-(^,HpV) 

(2.5) \\dH^ = max . 

^GR(P) (V',H-V) 

To examine 5H^ in further detail define 

(m \ ■ f (^,H-V)-(^,H-V) I , ^ ^ „ ,„ ^.]V2 

dim(cS)=r?i— i+l 



^In fact, it is even possible that 'D{'H.) n I?(Hp) = {0} and the form approach is still applicable. 
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for z = 1, . . . , m. Obviously, = r]rn{P)- A more detailed assessment of the prof of [fTSl 

Theorem 4.1] yields the following lemma. 

Lemma 2.1. Let h be positive definite, and let R(P) be the test subspace such that dim R(P) = 
m. Assume further that 

Vr+l{P) < Vr+2{P) < ■ ■ ■ < Vm{P) 

are all nonzero rji (P) from MM . Then rjm{P) < 1 and ir^^+i (P) , . . . , ±77,^ (P) are all non-zero 
eigenvalues of6Hf. Furthermore, ?7,.+i(P), . . . , rjm{P) are all non-zero singular values of the 
operator Ks = 6HfP\^^p^ G C{R{P), R(P)^). 

Proof. The proof of this lemma is implicitly contained in the proof of [15, Theorem 4.1]. We 
leave out most of the technical details. We only explicitly present arguments that r]miP) < 1- 
This fact was first established, in the matrix case, by Z. Drmac, Zagreb. Since R(P) reduces Hp 
we have 

/i(HpV, v) = hpiUp'f, v) = if,v), ve R(P), 

i.e. Hp^/ is a Galerkin approximation from the subspace R(P) to = ii~^f, which solves the 
problem = /. With this in hand one computes 

(2.7) /i(H- V - HpV, a-' 9 - Hp' 9) = (/, a-' 9) - (/, iip'9), f,9e R(p). 

This implies {f,H-^f) > (/, HpV) > 0, / G R(P) and < r]i{P) < 1 follows. Assume 
VmiP) = 1, then there exists / G R(P) \ {0} such that (/, HpV) = 0. This is an obvious 
contradiction with the fact that Hp is positive definite. □ 

Now, set formally Ao(H) := 0, 0o 00 and define 

(2.8) Qg := min ||Ag(H) - : /i G S(Hp) \ {/ii, 

n\ t\ \ ■ f\+m(H)-/im /il-Ag_i(H)^ 

(2.9) 7, Ag := mm <^ -^^ — — , — — —\, 

lAg+^(H)+/i^ /ii + Aq_i(H)J 

for g G N. This quantities — which measure the sensitivity of the eigenvalue Aq(H) — will play a 
role in the statement of the theorems in the next section. In the rest of the section we suppress 
the dependence of quantities on H and P in the notation. 

We now relate 7s (Ag) and Qg to r]i. The main result of ifTSl established that given R(P) C 
Q{h), dim R(P) = m and /ij < Acss there exist m eigenvalues A^^ < ■ ■ ■ < such that 

— <r]m, i = l,...,m 

holds. Under an additional assumption on the location of the unwanted component of the spec- 
trum we can localize the approximated eigenvalues and obtain that if e.g. r7m(l — Vm)^^ < 

(Am+l ~ /^m)(Am+l + Hm) ^ then 

(2.10) — — —<r]m, i = l,...,m. 

This assumption is similar to the assumption of the Temple-Kato inequality (11.31 ). For higher 
eigenvalues we have the following variant of iTTSl Theorems 5.1 and 5.2], which we present 
without proof. 
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Lemma 2.2. Let R(-P) be the test subspacefor the positive definite form h and let dim R(P) = m 
and g G N. If -. < 7s (A„) then flo > an J in particula^ 



Qq > mill 



- r]J - (1 + T^)A,_i (1 - T^)Xg+m - (1 + 



3. TEMPLE-KATO like inequality in THE PRESENCE OF RiTZ VALUE CLUSTERS 

We now present the main contribution of this article. We will derive relative eigenvalue esti- 
mates in the presence of Ritz value clusters. 

In this section we will need to elaborate on the notion of the unitary invariant operator norm 
(also known as symmetric or cross operator norms, cf. [2T1I301 and the references therein). This 
will allow us to extract more information from R(P) than what is contained in r]m{P) = \\Ks\\. 
In this section we will be dealing with only one orthogonal projection P, and so we simply write 
r]i := r]i{P), i = 1, . . . ,m, whenever there is no danger of confusion. Furthermore, we write 
Aj := Aj(H), i G N to simplify the notation. 

To say that the norm is unitary invariant on 5 C C{H) means that, beside the usual properties 
of any norm, it additionally satisfies: 

(i) : If B eS,A,C e OH) then ABC G S and 

(3-1) III ABC |||< pll III B III ||C||. 

(ii) : If A has rank 1 then ||| A |||= ||A||, where || ■ || always denotes the standard operator 
norm on C{H). 

(iii) : IfAeS and U, V are unitary on H, then UAV G S and 
(3.2) III UAV 1=11 A III . 

(iv) : S is complete under the norm ||| • |||. 

The subspace S is defined as a ||| ■ |||-closure of the set of all degenerate operators in £(7i). Such 
S is an ideal in the algebra CiH), cf. f30\. 

A typical example of such a norm is the Hilbert-Schmidt norm ||| ■ 1\hs- A bounded operator 
H -.n ^Hisa Hilbert-Schmidt operator if H*H is trace class and then, cf. ^ Ch. X.1.3], 
III H \\\hs:= Vt^IPH = [ ZT=i Si{Hf Y'^ 

Before we turn to the main theorem, let us give an alternative — more common — definition 
for the approximation defects r]i{P). To this end we further exploit the Galerkin orthogonality 
property of the P-diagonal part of h and in particular the ramifications of relation (12.71 ) from 
Lemma [27T1 For any / G R(P) we have 

Il[ti /-Mp/J- sup , r ni/2 - l|W- /-/IIh-1- 

With this we can write (12.61 ) in an alternative form 



|2 



(3.3) ,„(P):=[ ^ma.^^ ™n | V, , 5. |W = 1} 

dim(cS)=m— i+1 



1/2 



^Here we assume — = 0, for c > 0. 
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for i 



m. 



3.1. Operator matrices and the Wilkinson's trick. As a first step we shall outline the Wilkin- 
son's trick and state our adaptation of this result as a theorem. This result yielded (12.21) . We shall 
then proceed to prove eigenvalue estimates. Let us now generalize the Wilkinson's trick to 
operator matrices, cf. [|27l p. 183]. 



Theorem 3.1 (Wilkinson's trick). Let A : Hi ^ Hi and X : H2 ^ Hi be bounded operators 
and let A be self-adjoint. Assume further that B : H2 H2 is self-adjoint and that it has 

A Xl 

, to be understood as operator on Hi © H2- If 



a bounded inverse and define M 
dim N(M) = dim T^i < 00 then 



X* B 



A = XB~^X*. 



Proof. We shall adapt the Schur-complement technique from [|27l p. 183]. Since B ^ is assumed 
to be bounded we can write 



XB-i 
I 



(3.4) M = 

Both of the operator matrices 

S = 



A - XB-^X* 





B 



I 

B~^X* 



SBS* 



XB-i 
I 



s- 



-XB 1 
I 



define bounded operators on Hi (SH2, and so D = S'^MS^* . This implies that V{M) = ©(D) 
and as a consequence of a simple dimension counting we obtain that 

(3.5) dim N(M) = dim N(D) < 00 



Since B has a bounded inverse (13.51 ) can only be true if A — XB ^X* 
Wilkinson's trick and it proves the statement of the theorem. 



0. This is the so called 

□ 



Remark 3.2. Note that the theorem remains valid if we only assume that B is injective and 
B~^X* is bounded. In this case we conclude that A = X(B^^X*). In the case when Hi is 
infinite dimensional the dimension counting cannot be used to prove the result. Some spectral 
properties of Schur complements in a general situation can be found in [[22|. 

Theorem 3.3. Let R(-P) be the test subspace for the positive definite form h, as defined in 



Section^ and let dim R(P) = m and g G N. Assume that Aq_i < 



A 



< ls{\) hold then 

(3.6) 

In particular, for ||| ■ ||| = ||| 
(3.7) 



q+m- 



-1 < A 



q+m 



and 



I - A,S-i 



|< — III diag(r7i, ■ ■ ■ ,rj^) 

09 



m 

E 

i=l 



Hs cind fii G Si 
(A,-^.)2iV2 



we have the estimate 



< 



ril + 



+ Vm 



1/2 
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Proof. Let the fomi /ip be the P-diagonal part of h. A modification of [15. Theorems 5.1 and 
5.2] implies that 



defines the bounded operator Hs{\q), which allows the operator matrix representation 



"1/2 



(3.8) 



Hs{\ 



I 



with respect to 7^ = R(F) © R(-P)"^. Now, Lemma O implies that I - A^W"^ is invertible 
and we may use the Wilkinson's trick to derive quadratic estimates (for some further technical 
details see Lemma [T2|) . In particular we have || (I — A^W^^)^^ || = < oo. Now temporarily 

set Brei := (I-A^W"^), then 







I 



and Theorem 13 . 1 1 yields 
(3.9) I 
Property (13.11) of a unitary invariant norm 



I-A,S-i) 







A,H 



-1 





Biel 



I 



I- A, 



Kill 



implies 

III II fl 



A,W 



□ 



We apply (13.21) and Theorem [ZTI on the last inequality to complete the proof. 

The estimate of Theorem 13.31 was an equality up to (13.91) . So, there is more information in 
(13.91) than we have used so far. 



Theorem 3.4. If < Am+i and Ai = A^ then 



(3.10) 
(3.11) 



diag(r/i, 



<lll I- A, 



— Aj 



|< — III diag(??i, 
01 

m 



< 



-E 



i=l i=l i=l 

Proof. The assumption /i^ < A^+i implies 0i > 0. Now, this combined with Lemma IZ2l yields 
(3.12) K:K, < I - AgS-i < g^'KlK,. 

The conclusions now readily follow by an application of a norm ||| ■ ||| and the trace operator tr(-) 



on 



dm. 



□ 



We now relate 0i to the standard relative gap which was extensively studied in the relative 
perturbation theory. The result od Lemma [Z2l can be easily improved if we concentrate on the 
lower part of the spectrum. This is a reasonable assumption. 



Corollary 3.5. Assume that Ai 



then Qi > ^"^^ and in particular 



Am < Am+1 and rjm < (Arra+1 — firn.){^m.+l + fJrn) ^ 



(3.13) 



<\\ I 



|||< 



Am+l + n 



diag(?7i, 
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Remark 3.6. Theorem 13 .41 illustrates why this "relative" or form approach to the Temple-Kato 
inequality is more natural than the one which yielded (12.21) . The operator Brei in the relative 
block operator residual equation (|3.9I) is such that I < 'B^J < Qi^I. In comparison in the 
absolute block operator residual equation (12.21) it only holds that < B^^^ and so no lower 
estimate is obtainable. In the more general case of Theorem 13.31 the bounded operator Bjei is 
indefinite but it is always boundedly invertible. For further discussion of the optimality of the 
Schur complement approach see Section |4~n 

Proposition 3.7. Let Xm < A^+i and let R(P), dim R(P) = mbe the test space for the form h 
such that 2rim < 1 then 

1 IITT 112 ™ \ 

Proof. Let Ui G R(-P), i = 1, m be the Ritz vectors. The proof follows from 

^ WH—Ui - Milln-i = ^ {ui, H'^Ui) - {ui, E'^Ui) \ = ^\ {ui, B.~^Ui) 

i=i i=i ^ i=i 



m 



l^i — Xi 



Note that according to (13.151) below, we have J2^iVi ^ Yl'iLi ^^""m ^n"'^^"~^ . □ 

Remark 3.8. The upper estimate in the setting of Proposition 13 .7 1 can be achieved by a repeated 
application of the trace operator and the estimate (|3.15l) to the identity (13.91 ). The estimate is 
rather technical an we leave it out. However, we emphasize that we can recreate the framework 
of [[T2l Proposition 2.3] completely. 

3.2. A relationship with standard H^-norm residual estimates. This section addresses the 
issue of the computability of r]i(P) by relating these quantities to the standard H^^-norm es- 
timates of the residuals associated to the Ritz vector basis of R(P). The proofs as well as the 
results are technical and as such can be skipped on the first reading. 

Let us now reconsider the identities (12.61) and (13.31 ) and note that they can be understood as 
generalized matrix eigenvalue problems. Assume Ui, . . . ,Um are the Ritz vectors from R(P), 
then for i,j = 1, . . . , m, we define the matrices 

fijj = hCH^^Ui — E^-^Ui, ii~''^Uj — E^^Uj) 
Relation (12.71) from Lemma [2n implies that i7 is a positive definite matrix and in particular 

Qii = \\B.—Ui - Ui\\li-i = ^WHui - fiiUiWli-i, i = 1,. . .,m 
D^<^ <{l + 'Di)D^, 
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where = diag(;U]^ \ . . . and Di = — Dfj)D^^^'^\\. Now, with the help of 

jii = ||H-Uj||^_i, we obtain 

m m II TT II 2 



2=1 1=1 -n- 



and so we conclude that 



^ m mm 

(3.14) Y^SZ^--"^ ^ 5^^' ^ JZ^^n/^i- 

' i=l j=l i=l 

Estimate (|3.14l) can now be written as (cf. Proposition l3.7l) 

1 ™ IITT l|2 IIXT 112 

(3-15) T-^Z^ iiHn-ii^ ^2^^^ ^Z^Ti^rp — 



^-1 

-1 



By a similar argument one can conclude that asymptotically (as P converges to the m-dimensional 
spectral subspace) we have as a heuristic 



C-.+i(^)- 



This indicates that r]i(P) represent a canonical choice of residuals from R(P), i.e. not defined 
by the Ritz vectors Ui but rather the vectors which are selected by the variational formulae (|3.3I) . 

Remark 3.9. The definition of r]i indicates that the problem of computing (or estimating) r]i 
requires the solution of the m x m positive definite generalized eigenvalue problem. Since 
m is the multiplicity of the eigenvalue of interest, the computational cost of the solution of 
such problem is negligible. The main problem is how to evaluate or estimate the moments 
{ui, H^^Uj), i, j = 1, . . . ,m without actually inverting the operator H^^. For some possibilities 
to do this see ffTl Section 3.], ^ Section 5.] or [10, Remark 8]. 



4. A SIMPLE NON-INHIBITED STIFF PROBLEM 

These estimates have been used in [[T6l to study a class of eigenvalue problems which is given 
by the family of positive definite forms 

(4.1) h^{u,v) = hh{u,v) + K'^h^{u,v), k large. 

The forms h), and are assumed to be symmetric, closed and nonnegative and we further 
assume that hi, + is positive definite in H and that Q{hb + he) is dense in Ti.. Family (14.11 ) can 
always be considered as a perturbation of h^ + h^ (after an obvious change of variable k) rather 
than as a perturbation of h^, and so we assume, without affecting the generality of results, that hf, 
is positive definite and densely defined. 

A detailed study of the spectral property of the families like (14.11) is beyond the scope of 
this article and will be reported in subsequent publication. We will now consider a very simple 
problem of this form, and note that (|4.1I) motivated the example (11.61) . Let -f^o [0, 1] and i/g (M+), 
M+ := [O, oo) be the standard Sobolev spaces. We also identify the functions from Hq[0, 1] 
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with their extension by zero to the whole of IR+ and write -f^o [0, 1] C i/Q(M+). Consider the 
family of positive definite forms 

(4.2) hi^{u,v) = / u'v'dx + K^ I uv dx, ti, f G /Jq (R+). 

By we denote the positive definite operator defined by /i^ in (|4.2I) . The operators con- 
verge in the generalized sense to the operator Hoo, which is defined by the form hoo{u,v) = 
Jq u'v' dx, u,v G Hq [0, 1]. For further details on this convergence see [16] and the references 
therein. We also formally write = —dxx + fi'^X[i,oo) and Hoo = —dxx- As a test function(s) 
we chose 

..ox /X fv^sin(fc7ra;), 0<x<l 

(4.3) .,(x) = |^_ j-^- .,eN. 

Note that here Ug G but Ug ^ ©(H^). The eigenvalues of the operator have to be 

described implicitly. Let H^.v'^ = X'^v'^, then v'^ G C^(]R+) is 



sm(V X'^x), < X < 1 

1 < X 



1 



and A*^ is a solution of the equation 



(4.4) - A-^ = -VA^ cot(V>). 

The quotient ^ can be represented (for k —>■ oo) by a convergent Taylor series 

A?° - A? 1 1 /I 1 A 1 /I 4^1 

(4.5) = 2- - 3^ + 8 - + -TT^ 10 - + -TT^ -7 + ■ ■ 



We directly compute r]l{uq) := 3^ and combine it with (14.41) and the first order estimate from 
(|2.10l) to obtain 

Theorems 13.31 and |3 .41 now yield 

.... 2 A?°-Af D(ft) + 7r2 2 10 1 

(4.6) < ^- ^ < ^ = — + ^0 - , K>5, 

3 + K- A?° - L)(k) - 7r2 3 + K 3/t Vk/' " ' 

which is a tight estimate on the behavior of ^^^o ^ ■ Similar estimates hold for other eigenvalues 
and eigenvectors, too. This example illustrates the "efficiency" of this a posteriori estimator. 
Furthermore, it indicates a role which is played by the first order estimates from [15] in the 
general theory. For some details of the computation see [[T6ll . The Schroedinger operators in 
higher dimensions have also been studied in [[T6ll . The estimate for 'rf{uq) can in this case be 
computed by a use of the advanced probabilistic techniques from [[HI or by a use of the boundary 
layer techniques from [i3| (naturally, under the assumption that the domain is finite). 
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4.1. A framework for proving the asymptotic exactness. Let us go back to Remark IX6l The 
conclusion of Proposition 13.71 does not appear to be completely satisfactory. The factor fii/ fXm 
limits its applicability to a couple of the lowermost eigenvalues of H. The true power of the 
Schur complement technique can be seen if we rewrite (13.91 ) as 

(4.7) I - XgE-^ = k;Ks + XgK;w-^/\i - XgW-^)-^w~^/^Ks. 

After applying the trace operator on (|4.7I) and utilizing Lemma IZTI we obtain 

spm 2 + y-m 2 

In our K dependent problem we use this to prove (cf. (14.51 ) and (14.61 )) 



lim — - — = 1 

Furthermore, we see why this convergence is pretty rapid. In a general situation we perform 
this analysis by comparing the singular values Si{Ks(K)) with Sj(W^^_ft's(/t)) and noticing that 
Si(yV^^Ks{K)) is of higher order in k~^. Here we have assumed an obvious modification of the 
block matrix representation (|3.8I) for the k dependent problem. Exploiting (|4.7I) in the general 
setting of (14.11) as well in as in the setting of finite element approximations is beyond the scope 
of this paper and is a subject of subsequent reports. 

5. Finite element computations 

As a further explicitly solvable model example let us consider the family of eigenvalue prob- 
lems 

— ip" — aip = w ip, 

(5.1) e'V(O) = H2n), 

eV(0) = ^'(27r), 

where 9 E [0, vr] and we chose « G M so that the eigenvalues remain positive. The weak 
formulation of (15.11 ) is given by 

h{^,Vi) = Xi{H){ilj,v,), ^eQ{h), 

where 

(5.2) hi^, 0) := / - ai^(l)) , Q{h) := \^,^' E H, e'V(O) = ^(27r)} 

Jo 

and H = L^[0, 2%]. The eigenvalues of the problem (15.11 ) as well as the Green function of the 
operator H, which is defined by (15.21) are explicitly known, see [28, Theorem XIII. 89, Volume 
4. pp. 293] and [28, Equation (XIII. 154), pp. 292]. In particular we have 

(5.3) Ai(H)= + A2(H) = (^)'-a, X,{H) = [l + - a, 

Ztt /vr Ztt 

(5.4) v,{t) = e-i-'^^y, V2{t) = e~i^y, Vs{t) = e'^'^^)' 



18 



LUKA GRUBISIC 



N 


estimate (13.1011 


III ^-^^P(v^) 111^ 


s estimate (l377l) 


40 
60 
80 
100 
120 


7.9540e-001 
5.1413e-001 
3.4389e-001 
2.4120e-001 
1.7671e-001 


7.9540e-001 
5.1413e-001 
3.4389e-001 
2.4120e-001 
1.7671e-001 


7.9558e-001 
5.1422e-001 
3.4393e-001 
2.4123e-001 
1.7673e-001 



Table 1 . The performance of the estimates (|3.10l) and (13.71) on the family of 
test spaces R(P(V^)) and for the choice of the norm ||| ■ ||| = ||| ■ Whs- The compu- 
tational details can be found in lfT6l Section 2.7.3, pp. 64]. 



anc 



(V^, H- V) = / dh G{h - t2)W2)<P{ti) dh 
Jo Jo 



Let us now choose 9 = n and a = 0.2499 for our numerical experiment. With this choice of 
parameter the problem (|5.1I) is almost singular and Ai(H) = A2(H). For N eN define the finite 
element space 

Vl^ = {iP\iPe C[0,2n],-^{0) = ^{27i),^is linearin Jp,p = 1,...,N}, 
where := and use 

(5.5) Mi(VL) := max min , i = l,2 

dimcS =dimV^ — i 

to define the Rayleigh-Ritz approximations to the eigenvalue Ai(H) = A2(H). Let also Mj(V^) G 
Vjf, i = 1,2 be two vectors of norm one for which yUi(Vj^) = h[ui{Vli)], i = 1,2 holds. 
Now, let P(V^) be an orthogonal projection onto the linear span of {Mi(Vj^), M2(V]^)} and set 
Sp(vi^) = Hp(vir)-^(^Af)- ^^'^ ^PPly Theorems 13.31 and [34l on the projections P(Vj^) and 
display the results on Table [IJ 

5.1. Hierarchical error estimation. The results from Table [T] show that r]i, . . . ,T]m accurately 
capture the behavior of the relative error as 0. The explicit knowledge of the Green 

function is most certainly an information which cannot in general be assumed when considering 
higher dimensional eigenvalue problems. Let us now consider an application of these estimates 
in the context of the adaptive finite element methods for divergence type elliptic self-adjoint 



^We implicitly assume that H = L^[0, 27r] 
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operators in dimension two. We only present a feasibility argument, an algorithmic development 
will be a subject of a subsequent report. 

For the sake of definiteness let Ti = L'^iJZ), where TZ is assumed to be a bounded polygonal 
domain and let 

(5.6) h{u,v) = [ {VuyVv, u,v e Q{h) = H^{n). 

Jn 

By H^(1Z) = {u G H : + ||Vm|P < 00} we denote the standard first order Sobolev 

space. The gradient V is meant in the weak sense and || • || denotes the norm on L'^{TZ) and 
Hq{TZ) C H^{TZ) is assumed to be equipped with the norm \\u\\e ■= ||Vm|| = and it 

consists of those H^{1Z) functions which vanish on the boundary of TZ in the sense of the trace 
operator. 

The set is called a triangulation of the polygonal domain TZ if it consists of the triangles 
such that union of these triangles is TZ and such that the intersection of two such triangles either 
consists of a common side or of a common vertex of both triangles or is empty. By d we denote 
the maximal diameter of all triangles in 7^. For a given triangulation 7^ we define the finite 
dimensional function spaces: 

V\ = [u e Q\u e C{TZ) and is linear ,K eTd} 

Vd = {u e Q\u e C( ^ ) and f is quadratic ,K eTd} 

and the orthogonal projections Vi^d, i = 1,2 such that R(Vj d) = V^, i = 1,2. To simplify the 
notation we write Hj^ := Hy. ^, i = 1,2 and also define the orthogonal projection Pd such 
that R(Pd) equals the linear span of {ui ^, W2,d}- In what follows we assume, as in [9], that Td is 
graded and shape regular family of triangulations and that it satisfies the nondegeneracy property 
Il9l Assumption 4.1]. Let us assume that we have 

(5.7) /ii d ■■= max | min M^I^ : S C V}, dimS = dimV] - (i + i = l,2 

'-Ve-s\{o} [iJ,iJ) 

and Ui^d G Vj-^, i = 1, 2 are chosen so thai^ Hi,rf Ui^d = lJ'i,d Ui,d, i = 1,2. The result flU Theorem 
1.1] and in particular the last remark on [9. pp. 12] yield the estimate 

h[^'^Ui,d - a^^d^i^dV^^ IIH-v^i-HsXdlU ^ . ^ ^ 

(5.8) 3i r ~ ~i — n — n ^ ct^ i = 1,2 

with the constant a which depends solely on the shape regularity of 7^. Set rj ^ = ^^24^14 — 
Hi^Ui^d- Combining (15.81 ) and |l2l Estimate (2.16)] we conclude that there exists constants C^: 
and c*, solely depending on the shape regularity of Td, such that 

(5.9) j}^}^ < ^^^4^^4ru-^ ^ a^^^!:^, , = 1,2. 

This estimate can now be directly plugged into the trace type estimates from Theorem 13.41 or 
Proposition 13.71 Furthermore, Remark 13.91 allows us to exploit other unitary invariant norms 
with similar ease. 



^This can be checked by a direct computation. 
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Remark 5.1. Note that H2,d|v2 G jC{Vj). The norms of the residuals r^ ^ G V|-^ can 
efficiently (cheaper than when solving a linear system) be approximated as functions of the 
vectors Wi^d G V|-^ V-f^, 2 = 1,2, as given by Theorem 2.1]. Similar consideration has been 
explored in f25^, Estimates (29)-(30)] (cf. [T, Theorem 2.2]), but in comparison our estimates 
give more explicit information on the dependence of the constants on the mesh and provide the 
optimality argument, too. 

To summarize, the arguments of Remark [X9] indicate that it is possible to estimate the 
norm of the residual cheaper than it takes to solve the linear system. Furthermore, we have shown 
that when deciding on the convergence of the finite element method the size of 

"■2,d 

should be compared to the relative gap measure Qg to decide if the approximation is good enough. 
By this we mean if the whole multiplicity of the target eigenvalue has been resolved by R{Pd)- 
To get a feeling for this statement one should remember the picture of [il2i, Example 2.1]. 

6. Conclusion 

The main benefit of our approach is that, as the theoretical considerations from Section |4] and 
Table [H corroborate, up to (15.91) we have had globally optimal estimates for the eigenvalue error 
(i.e. almost no information was lost). After (15.91 ) we have started aggressively trading off accu- 
racy for speed. Our theory is such that this can be achieved, in numerous situations, by a simple 
combination of the Galerkin orthogonality condition and any of the "of the shelf" results like the 
those from [|2l|9l| or |[3l[Bl in the singularly perturbed setting. On top of this comes the heuristic 
insight from (TT, Section 3] which indicates that we have properly identified the components 
of the error as given by (11.21 ). It should be noted that our Theorem 13 .31 directly corresponds to 
lfT2l Proposition 2.3], since both are motivated by IfTHl . Furthermore, the estimates from The- 
orems [33] and [3]4] can be combined with [fT5l Theorem 6.1] and the well known identity from 
||23l Ad (v), pp.617] to obtain optimal estimates for the eigenvalue error \\ui — The 
estimates can even be obtained in a situation in which the multiple eigenvalue splits in a cluster 
of eigenvalues. This is a subject of the foUowup report. As a conclusion let us remember the 
remarks 1), 2), 3) from the Introduction. We have introduced a matrix analytic techniques which 
tackle both test vectors outside the domain of definition of the operator and the multiplicity of 
the approximated eigenvalue in a natural and constructive way. Furthermore, Remark [X9l opens 
a way to exploiting other unitary invariant norms for scaling robust eigenvalue estimation. 
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